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Longitudinal studies of test-^wiseniss (TO) were Qonducted to determine: 
(a) the relationship between TW and grade level j (b) the ralatlonship between 
TW and sex, (c) the stability of TO, Aspects of TO observed included stem 
cue and specif ic determiner Identification and usage and the elimination of 
siTOil^^r and absurd options. Subjects were observed three times at two year 
Intervals in grades 5, 7, and 9 (n=75)| 6, 8, 10 (n^76)| 7, 9, 11 (n^73) | and 
8, 10, 12 (n*e4). 

Results showed low to moderate internal consistency for the 16 item TO 
scale and tost--retest correlations of approKimately the same magnitude as 
KR20 indicating TO was somewhat stable over the intervals observed* Sex by 
year MANOVA trend analysis showed no sgk effects and no seK by year Interact 
tlons* Significant (P<»05) linear trend was observed over all four-year 
intervals with a quadratic trend appearing in the 7, 9, 11 sainple. 
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Millffianj Bishop, and Ebel (1R65)^ define test-wlseness (TW) as "a 
subject's capacity to utilize the characteristics and formats of the test 
and/or test taking situation to receive a high scOTe." Ebel (1972) suggeste 
that TO is a concern K^hen the examinee is faced with novel itera types or 
when coimon item faults are present* . 

Previous re/-iarch has shorn that TO is net highly related to cognitive 
ability (Diamond and Ivans ^ 1972) and that TO can be learned (Gibb, 1964| 
Moore jSchutEj and Baker, 1966; Slakter, Koehler, and Hampton/ 1970b)/ 
Tlie relatlonshipa among TW, grade level and sex have been investl-^ 
gated in cross sectional (Slakter, Koehler, and Hampton, 1970a) and longi-- : 
tudinal studies (Crehan, Koehler, and Slakter, 1974). The longitudinal 
studies (Crehan et al*, 1974) involved observation of four aspects of TO 
for itudents in grades S through 9 at time one and repeated observations of 
the same stud en ti two years later in grades 7 through 11* Results indicated i 
(a) no sex by grade interaction, (b) increases In TO over all Intervals 
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except the grade 9 to 11 Interval , (c) little evidence for b^k differences 

In TWj and (d) TW remained relatively stable over the intervals obsarved. The 

present research reports the results of a third observation of TW following an 
fidditional two year interval. 

Method 

Ss were all available public sehool students in grades 9 through ' 
12 who had been observed on both the first and second occasions* The maKi^ 
mum possible sample slge was 391 and inatches were obtained for 288 Ss* 

The measurement Instruments included the same 16 TIJ items embedded 
In 28 legitimate items which varied to suit the particular grade level* 
The TW items are designed to measure four aspects of TW from among those 
listed by Millman ct al* (1965)* Four Items each are designed to measure 
the following TW behaviors: . 

1* recognize and select an option which resembles an aspect of 

the stem (stem-option)^ 
2, recognise and eliminate those options which contain specific 

deteirminers I e.g.| always, never ^ only (specif ic--determlners) * 
3# eliminate those options which are knoTO to be incorrect and 

select from among the remaining options (absurd-options) * and 
4* eliminate similar options and select from among the remaining 

altarnatives (sirailar-optiona) * 

The first two TW behaviors Involve cue using strategies and the second two 
Tt^ aspects Involve deductive reasonino* With the exception of the aspect 
of TW being obscrvcd>th© items were content free. That is > the keyed 
response could not be determined from knowledge of the nominal subject 
matter. Billow ara presented oKamplcs of items designed to measure each 
aspect of ttf and the basis for identification of the keyed responsep 



1. Stem-Op tioni 

The flying-spider Is known £qv Its ability toi 

1. blend in with its aurroundiftgs, 

* 2i glide through the air. 

3. kill Its prey with poison, 

4, make very large webs. 

Since the £ lylng-^spider does mt exist, the item cannot be answered from 
knowledge of content. However , the teat-wiEe eKamlnee Is e^^ected to 
recognlEe the correspondence betweea the stem-eye '-flying" and the option 
"glide through the air#" 

2* Specif ic-^De terminer s I 

The Japanese game ©f paduki: 



1. can only played by tha Impetrial Family. 

2. can never be played by t Saras. . 

* 3. is sometiTnes played indoors* 

4. is played at every celebration. 

Since the game of paduki does not e^ist and very few absolutes are universally 

true, the test-wise eKaminea is eKpected to eliminate options CDntaining 

specif ic-determiners and select froin among the remaining options. 

3. Absurd-Options: 

The greatest advjntage of using slent in the manufacture of steel 
Is that slent makes the steel: 

* !• stainless 
2# transparent 

3. heavy ; 

4. bulky 

The test-wise person is expected to eliminate options 2, 3, and 4 from con-! 
slderation and identify option 1 as the only plausible alternative. 
4i Similar-Options: / 
Mien Destor crystals are addod to water: 
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1. heat la given off. 

2. the teniperature of the solution rises. 

3. the container becomes warmer, 
* 4* the solution turns blue* 

Options 1, 2 J and 3 can be aliinlnated since they imply a similar result fifom 

adding non-existent Bestor crystals to wateri and therefore, option 4 is 

selected* 

Th& measurement instriiment was administered three times at two-year 
intervals to the Ss in their own classrooms by their own teachers using 
standardized procedures, Ss were led to believe that the instrument was an 
aptitude test and a part of the school* s regular^ testing program. 

Results and Discussion 
Table 1 presents the Kuder-Richardson Formula 20 (KR20) reilabilities 
for the crossHsectional data by grade. The Internal consistency of ' the TW 
results was not high for any grade* However, for a scale of only 16 items 
the KR20-S ware comparatively high and appeared to increase over grades 
with a median KR20 of *40 for grades 5 through S and a median KR20 of .65 
for grades 9 through 12, The two-and four--year test-'retest correlations 
(see Tablfe 2) are of approximately the same magnitude as the KR20 reliabilities 
giving indication that TW was a somewhat stable trait over the intervals 
observed, 

MeanSi standard deviations, and sample sizes by aex are reported in 
Table 3* Sex by year >IANOVA trend analysis resulted in no evidence of stable 
sex t^ifferences or sex by year interactions. Significant (Pi* 05) linear 
trends were observed over all of the four-year intervals with a significant _ 
quadratic trend over the 7 through 11 interval. The proportion of total sums 
of squares attributable to the linear trend was ,41 for the 5, 7, 9 group, 
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.35 for the 6, 8, 10 group,. .44 for the 7» 9, 11 group (,09 attributabla 
to the quadratic component for the 75 9, 11 group), and .37 for the 8, 10, 
12 group* In summary, TO appears to be somewhat stable and increasing 
over grades with no evidence of se^ differences or sex by grade interaction 

Given the present results, the generality of W across content areas 
(Milltnanj 1966) ^ and that TW cues are present in objective items on standard 
l^ed as well as teacher-raade tests^ then the following should be considered 
by parsons Involved with educational measurements! (a) the degree to which 
a test measures TO is related to the observed stability of the results of 
aptitude or achievement tests and (b) if TW is not controlled then eKamlnees 
low in TO will be penalized on objective test scores across subject areas 
both at a point in time and o%^er time. Since it is unlikely that TO cues ^ 
can be totally eliminated from objective tests It Is reconmiended that an 
attempt be made to control for individual differences In TW by teaching test- 
taking strategies to all students and ©specially those Identified as low in 
TW. 
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, TABLE 1 

^-20 Reliabilities for Cross-Sectional Observations of Ttf 



Grade 





5 


6 


7 


8 


9 


10 


11 


12 


Firsc 

Observation 


.40 


.18 


.44 


,40 


.63 


.60 


.72 




Second 
Observation 


.41 


.30 


.51 


.19 


.65 


.62 


.71 




Third 

Observation 










.65. 


.52 


.66 


.69 



tegt-HeCoaC CorrelaClgna (SEabllltlee) 
of Matched TW ObservaElofis Ove* Tw'o- and Feut-Year InCervals 
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f v.^ 1 
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2 Yf 
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2 Ycnr 


4 Year 


2 Yonr 


4 Yonr 


2' Year 


4 Ye/ir 


5-7 


7-9 




6-a 


8-10 


6-10 


7-i 




7-11 


8-10 


10-12 


8-12 


Halt 




.41 *3? ,37 


4a 


.04 .35 ai 


37 


.4a ,60 


,50 


23 


,66 


,67 


.57 ^ 




19 


.31 ,39 *50 


34 


,14 ,31 .37 


36 


*58 ,77 ,58 


41 


.43 ,5a .45 




TABLE 3 



Mean TH by Sex Within Grade for Matched 
Longitudinal Data (Standard Deviations In Parentheiai) 



First Second Third 

Sex W Gd. Observation Gd> Observation Gd. Observation 



Male 
Feinale 


46 
29 


5 


5.93 (2.27) 
5.69 (2.3b) 


7 


6.74 (2.62) 
7.31 (2.45) 


9 


8.22 (3.12) 
8.31 (2.77) 


Male 
Female 


42 
34 


6 


7.50 (1.76) 
6.97 (1.75) 


8 


8.50 (1.89) 
7.74 (1.94) 


10 


9.31 (2.79) 
9.29 (2.29) 


Male 
Female 


37 
36 


7 


6.57 (2.54) 
6.03 (2.71) 


9 


8.16 (2.80) 
8.22 (3.22) 


11 


8.73 (3.02) 
8.61 (2.90) 


Male 
Female 


23 
41 


8 


7.61 (2.33) 
7.56 (2.51) 


10 


8.61 (2.98) 
8.63 (2.95) 


12 


10.26 (3.63) 
9.41 (2.78) 
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